Markov decision processes (MDPs), also called stochastic dynamic programming, were first studied in the 1960s. MDPs can be ...